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(57) Abstract 

Purified BMP-9 proteins and processes for producing them are disclosed. The proteins may be used in the treatment of 
bone and cartilage defects and in wound healing and related tissue repair. 
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BMP-9 COMPOSITIONS 

The present invention relates to a novel family of purified 

10 proteins designated BMP-9 proteins and processes for obtaining 
them. These proteins may be used to induce bone and/ or 
cartilage formation and in wound healing and tissue repair. 

The murine BMP-9 DNA sequence (SEQ ID NO: 1). and amino 
acid sequence (SEQ ID NO: 2) are set forth in Figure 1. Human 

IS BMP-9 sequence is set forth in Figure 3 {SEQ ID NO: 8 and SEQ 
ID NO: 9) . It is contemplated that BMP-9 proteins are capable 
of inducing the formation of cartilage and/or bone. BMP-9 
proteins may be further characterized by the ability to 
demonstrate cartilage and/or bone formation activity in the rat 

20 bone formation assay described below. 

Murine BMP-9 is characterized "by comprising amino acid 
#319 to #428 of Figure 1 (SEQ ID NO: 2 amino acids #1-110) . 
Murine BMP-9 may be produced by culturing a cell transformed 
with a DNA* sequence comprising nucleotide #610 to nucleotide 

25 #1893 as shown in Figure 1 (SEQ ID NO: 1) and recovering and 
purifying from the culture medium a protein characterized by 
the amino acid sequence comprising amino acid #319 to #428 as 
shown in Figure 1 (SEQ ID NO: 2) substantially free from other 
proteinaceous materials with which it is co-produced. 

30 Human BMP-9 is expected to be homologous to murine BMP-9 

and is characterized by comprising amino acid #1 (Ser, Ala, 
Gly) to #110 of Figure 3 (SEQ ID NO: 9) (Arg) . The invention 
includes methods for obtaining the DNA sequences encoding human 
BMP— 9 . This method entails utilizing the murine BMP— 9 

35 nucleotide sequence or portions thereof to design probes to 
screen libraries for the human gene or fragments thereof using 
standard techniques. Human BMP-9 may be produced by culturing 
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a cell transformed with the BMP-9 DNA sequence and recovering 
and purifying BMP-9 from the culture medium* The expressed 
protein is isolated , recovered, and purified from the culture 
medium* The purified expressed protein is substantially free 
5 from other proteinaceous materials with which it is co- 
produced, as. well as from other contaminants. The recovered 
purified protein is contemplated to exhibit cartilage and/or 
bone formation activity* The proteins of the invention may be 
further characterized by the ability to demonstrate cartilage 

10 and/or bone formation activity in the rat bone formation assay 
described below. 

Human BMP-9 may be produced by culturing a cell 
transformed with a DNA sequence comprising nucleotide #124 to 
#453" as shown in SEQ ID NO: 8 and recovering and purifying from 

15 the culture medium a protein characterized by the amino acid 
sequence of SEQ ID NO: 9 from amino acid #1 to amino acid #110 
substantially free from other proteinaceous materials with 
which it is co-produced. 

Another aspect of the invention provides pharmaceutical 

20 compositions containing a therapeutically effective amount of 
a BMP-9 protein in a pharmaceutically acceptable vehicle or 
carrier. BMP-9 compositions of the invention may be used in 
the formation of cartilage. These compositions may further be 
utilized for the formation of bone. BMP-9 compositions may 

25 also be used for wound healing and tissue repair. Compositions 
of the invention may further include at least one other 
therapeutically useful agent such as the BMP proteins BMP-1, 
BMP-2, BMP-3, BMP-4 r BMP-5, BMP-6, and BMP -7 disclosed for 
instance in PCT publications WO8a/002O5, WO89/10409, and 

30 W090/11366, and BMP-8 f disclosed in U.S. application Ser- No. 

07/641,204 filed January 15, 1991, Ser. No. 07/525,357 filed 
May 16, 1990, and Ser- No. 07/800,364 filed November 20, 1991. 

The compositions of the invention may comprise, in 
addition to a BMP— 9 protein, other therapeutically, useful 

3 5 agents including growth factors such as epidermal growth factor 
(EGF) , fibroblast growth factor (FGF) , transforming growth 
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factor (TGF-a and TGF-/3) , and insulin- like growth factor (IGF) . 
The compositions may also include an appropriate matrix for 
instance, for supporting the composition and providing a 
surface for bone and/or cartilage growth. The matrix may 
5 provide slow release of the osteoinductive protein and/or the 
appropriate environment for presentation thereof. 

The BMP-9 compositions may he employed in methods for 
treating a number of bone and/or cartilage defects, periodontal 
disease and various types of wounds. These methods, according 

10 to the invention, entail administering to a patient needing 
such bone and/or cartilage formation wound healing or tissue 
repair, an effective amount of a BMP- 9 protein. These methods 
may also entail the administration of a protein of the 
invention in conjunction with at least one of the novel BMP 

15 proteins disclosed in the co-owned applications described 
above. In addition, these methods may also include the 
administration of a BMP- 9 protein with other growth factors 
including EGF, FGF, TGF-a, TGF-0, and IGF. 

Still a further aspect of the invention are DNA sequences 

20 coding for expression of a BMP- 9. protein. Such sequences 
include the sequence of nucleotides in a 5' to 3' direction 
illustrated in Figure 1 (SEQ ID NO: 1) and Figure 3 (SEQ ID NO: 
8) or DNA sequences which hybridize under stringent conditions 
with the DNA sequences of Figure 1 or 3 and encode a protein 

25 having the ability to induce the formation of cartilage and/or 
bone. Finally, allelic or other variations of the sequences of 
Figure 1 or 3 , whether such nucleotide changes result in 
changes in the peptide sequence or not, are also included in 
the present invention. 

30 A further aspect of the invention includes vectors 

comprising a DNA sequence as described above in operative 
association with an expression control sequence therefor. 
These vectors may be employed in a novel process for producing 
a BMP-9 protein of the invention in which a cell line 

35 transformed with a DNA sequence encoding a BMP— 9 protein in 
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operative association with an expression control sequence 
therefor, is cultured in a suitable culture medium and a BHP-9 
protein is recovered and purified therefrom- This process may 
employ a number of known cells both prokaryotic and eukaryotic 
5 as host cells for expression of the polypeptide. 

Other aspects and advantages of the present invention will 
be apparent upon consideration of the following detailed 
description and preferred embodiments thereof . 

10 Brief Description of th^ Drawing 

FIG- 1 comprises DNA sequence and derived amino acid sequence 
of murine BHP-9 from clone ML14a further described below. 

FIG, 2 comprises DNA sequence and derived amino acid sequence 
15 of human BMP-4 from lambda XJ2QS-3 ATCC #40342. 

FIG. 3 comprises DNA sequence and derived amino acid sequence 
of human BMP-9 from X FIX/H6111 ATCC # 75252. 

20 detailed P ^sen-ittton of the Xnventjpn 

The murine BMP-9 nucleotide sequence (SEQ ID NO: 1) and 
encoded amino acid sequence (SEQ ID NO: 2) are depicted in 
Figure 1. Purified murine BMP-9 proteins of the present 
invention are produced by culturing a host cell transformed wth 

25 a DNA sequence comprising the DNA coding sequence of Figure 1 
(SEQ ID NO: 1) from nucleotide #610 to nucleotide #1893 and 
recovering and purifying from the culture medium a protein 
which contains the amino acid sequence or a substantially 
homologous sequence as represented by amino acid #3X9 to #428 

30 of Figure 1 (SEQ ID NO: 2) . The BMP-9 proteins recovered from 
the culture medium are purified by isolating them from other 
proteinaceous materials from which they are co-produced and 
from other contaminants present. 



35 
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Human BMP-9 nucleotide and amino acid sequence is depicted 
in SEQ ID No: 8 and 9. Mature human BMP— 9 is expected to 
comprise amino acid #1 (Ser, Ala, Gly) to #110 (Arg) . 

Human BMP— 9 may be produced by culturing a cell 
5 transformed with a DNA sequence comprising nucleotide #124 to 
#453 as shown in SEQ ID NO: 8 and recovering and purifying from 
the culture medium a protein characterized by the amino acid 
sequence of SEQ ID NO: 9 from amino acid #1 to amino acid #110 
substantially free from other proteinaceous materials with 

10 which it is co-produced. 

BMP-9 proteins may be characterized by the ability to 
induce the formation of cartilage. BMP-9 proteins may be 
further characterized by the ability to induce the formation of 
bone. BMP— 9 proteins may be further characterized by the 

15 ability to demonstrate cartilage and/or bone formation activity 
in the rat bone formation assay described below. 

The BMP-9 proteins provided herein also include factors 
encoded by the sequences similar to those of Figure 1 and 3 
(SEQ ID NO's: 1 and 8), but into which modifications are 

20 naturally provided {e.g. allelic variations in the nucleotide 
sequence which may result in amino acid changes in the 
polypeptide) or deliberately engineered. For example, 

synthetic polypeptides may wholly or partially duplicate 
continuous sequences of the amino acid residues of Figure 1 of 

25 Figure 3 (SEQ ID NO's: 2 and 9). These sequences, by virtue of 
sharing primary, secondary, or tertiary structural and 
conformational characteristics with bone growth factor 
polypeptides of Figure l and Figure 3 may possess bone growth 
factor biological properties in common therewith. Thus, they 

3 0 may be employed as biologically active substitutes for 
naturally-occurring BMP— 9 and other BMP— 9 polypeptides in 
therapeutic processes. 

Other specific mutations of the sequences of BMP-9 
proteins described herein involve modifications of 

35 glycosylation sites. These modifications may involve O-linked 
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or N-linked glycosylation sites. For instance, the absence of 
glycosylation or only partial glycosylation results from amino 
acid substitution or deletion at asparagine-lixticed 
glycosylation recognition sites. The asp aragine-1 inked 
5 glycosylation recognition sites comprise tripeptide sequences 
which are specifically recognized by appropriate cellular 
glycosylation enzymes. These tripeptide sequences are either 
asparagine-X-threonine or asparagine-X-serine, where X is 
usually any amino acid. A variety of amino acid substitutions 

10 or deletions at one or both of the first or third amino acid 
positions of a glycosylation recognition site (and/or amino 
acid deletion at the second position) results in non- 
glycosylation at the modified tripeptide sequence. 

The present invention also encompasses the novel DNA 

15 sequences, free of association with DNA sequences encoding 
other proteinaceous materials , and coding on expression for 
BMP-9 proteins. These DNA sequences include those depicted in 
Figure 1 or Figure 3 (SEQ XD NO's: 1 and 8) in a 5' to 3' 
direction and those sequences which hybridize thereto under 

20 stringent hybridization conditions [see, T. Maniatis et al r 
Molecular cloning (A Laboratory Manual). , Cold Spring Harbor 
Laboratory {1982), pages 387 to 389] and encode a protein 
having cartilage and/ or bone inducing activity. 

Similarly, DNA sequences which code for BMP-9 proteins 

25 coded for by the sequences of Figure 1 or Figure 3, but which 
differ in codon sequence due to the degeneracies of the genetic 
code or allelic variations (naturally-occurring base changes in 
the species population which may or may not result in an amino 
acid change) also encode the novel factors described herein. 

30 Variations in the DNA sequences of Figure 1 or Figure 3 (SEQ ID 
NO: 1 and 8) which are caused by point mutations or by induced 
modifications (including insertion, deletion, and substitution) 
to enhance the activity, half -life or production " of the 
polypeptides encoded are also encompassed in the invention - 

35 Another aspect of the present invention provides a novel 
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method for producing BMP-9 proteins. The method of the present 
invention involves culturing a suitable cell line, which has 
been transformed with a DNA sequence encoding a BMP-9 protein 
of the invention, under the control of known regulatory 
5 sequences. The transformed host cells are cultured and the 
BMP-9 proteins recovered and purified from the culture medium. 
The purified proteins are substantially free from other 
proteins with which they are co-produced as well as from other 
contaminants . 

10 Suitable cells or cell lines may be mammalian cells, such 

as Chinese hamster ovary cells (CHO) . The selection of 
suitable mammalian host cells and methods for transformation, 
culture, amplification, screening, product production and 
purification are known in the art. See, e.g., Gething and 

15 Sambrook, Nature r 293 ;62Q-625 (1981) , or alternatively, Kaufman 
et al, Wol. Cell. BjoJ^, 5(7) ; 1750-1759 (1985) or Howley et al, 
U.S. Patent 4,419,446. Another suitable mammalian cell line, 
which is described in the accompanying examples, is the monkey 
COS-l cell line. The mammalian cell CV-1 may also be suitable. 

20 Bacterial cells may also be suitable hosts. For example, 

the various strains of J£. coli (e.g., HB101, MC1061) are 
well-known as host cells in the field of biotechnology. 
Various strains of £. sufefcilis, Pseugp?nonas , other bacilli and 
the like may also be employed in this method. 

25 

Many strains of yeast cells known to those skilled in the 
art may also be available as host cells for expression of the 
polypeptides of the present invention. Additionally, where 
desired, insect cells may be utilized as host cells in the 

30 method of the present invention. See, e.g. Miller et al, 
Genetic Engineering . 8.: 277-298 (Plenum Press 1986) and 
references cited therein. 

Another aspect of the present invention provides vectors 
for use in the method of expression of these novel BMP— 9 

35 polypeptides. Preferably the vectors contain the full novel 
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DNA sequences described above which encode the novel factors of 
the invention. Additionally the vectors also contain 
appropriate expression control sequences permitting expression 
of the BMP-9 protein sequences. Alternatively , vectors 
5 incorporating modified sequences as described above are also 
embodiments of the present invention. The vectors may be 
employed in the method of transforming cell lines and contain 
selected regulatory sequences in operative association with the 
DNA coding sequences of the invention which are capable of 

10 directing the replication and expression thereof in selected 
host cells. Regulatory sequences for such vectors are Known to 
those skilled in the art and may be selected depending upon the 
host cells. Such selection is routine and does not form part 
of the present invention. 

15 A protein of the present invention, which induces 

cartilage and/ or bone formation in circumstances where bone is 
not normally formed, has application in the healing of bone 
fractures and cartilage defects in humans and other animals. 
Such a preparation employing a BMP-9 protein may have 

20 prophylactic use in closed as well- as open fracture reduction 
and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes 
to the repair of congenital, trauma induced, or oncologic 
resection induced craniofacial defects, and also is useful in 

25 cosmetic plastic surgery. A BMP-9 protein may be used in the 
treatment of periodontal disease, and in other tooth repair 
processes. Such agents may provide an environment to attract 
bone- forming cells, stimulate growth of bone-forming cells or 
induce differentiation of progenitors of bone-forming cells. 

30 BMP-9 polypeptides of the invention may also be useful in the 
treatment of osteoporosis. A variety of osteogenic, 
cartilage-inducing and bone inducing factors have been 
described. See, e.g. European patent applications 148 > 155 and 
169,016 for discussions thereof - 

35 
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The proteins of the invention nay also be used in wound 
healing and related tissue repair* The types of wounds 
include, but are not limited to burns r incisions and ulcers* 
(See, e.g. PCT Publication WO84/01106 for discussion of wound 
5 healing and related tissue repair) . 

It is further contemplated that proteins of the invention 
may increase neuronal survival and therefore be useful in 
transplantation and treatment of conditions exhibiting a 
decrease in neuronal survival. 

10 A further aspect of the invention is a therapeutic method 

and composition for repairing fractures and other conditions 
related to cartilage and/or bone defects or periodontal dis- 
eases. The invention further comprises therapeutic methods and 
compositions for wound healing and tissue repair. Such 

15 compositions comprise a therapeutically effective amount of at 
least one of the BMP-9 proteins of the invention in admixture 
with a pharmaceutical^ acceptable vehicle r carrier or matrix. 

It is expected that the proteins of the invention may act 
in concert with or perhaps synergist ically with other related 

20 proteins and growth factors. Further therapeutic methods and 
compositions of the invention therefore comprise a therapeutic 
amount of at least one BMP-9 protein of the invention with a 
therapeutic amount of at least one of the other BMP proteins 
disclosed in co-owned applications described above. Such 

25 combinations may comprise separate molecules of the BMP 
proteins or heteromolecules comprised of different BMP 
moieties. For example, a method and composition of the 
invention may comprise a disulfide linked dimer comprising a 
BMP-9 protein subunit and a subunit from one of the "BMP" 

30 proteins described above. A further embodiment may comprise a 
heterodimer of BMP-9 moieties. Further, BMP-9 proteins may be 
combined with other agents beneficial to the treatment of the 
bone and/or cartilage defect, wound, or tissue in question. 
These agents include various growth factors such as epidermal 

35 growth factor (EGF) , platelet derived growth factor (PDGF) , 
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transforming growth factors (TGF-a and TGF-0) / and insulin-like 
growth factor (IGF) . 

The preparation and formulation of such physiologically 
acceptable protein compositions, having due regard to pH, 
5 isotonicity, stability and the like, is within the skill of the 
art. The therapeutic compositions are also presently valuable 
for veterinary applications due to the lack of species 
specificity in BMP proteins. Particularly domestic animals and 
thoroughbred horses in addition to humans are desired patients 

10 for such treatment with BMP-9 of the present invention - 

The therapeutic method includes administering the 
composition topically, systemically, or locally as an implant 
or device. When administered, the therapeutic composition for 
use in this invention is, of course, in a pyrogen-free, 

15 physiologically acceptable form. Further, the composition may 
desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. 
Topical administration may be suitable for wound healing and 
tissue repair. Therapeutically useful agents other than the 

20 BMP-9 proteins which may also optionally be included in the 
composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially 
with the BMP composition in the methods of the invention. 
Preferably for bone and/or cartilage formation, the 

25 composition would include a matrix capable of delivering BMP-9 
or other BMP proteins to the site of bone and/ or cartilage 
damage, providing a structure for the developing bone and 
cartilage and optimally capable of being resorbed into the 
body. The matrix may provide slow release of BMP-9 and/or the 

30 appropriate environment for presentation thereof- Such 
matrices may be formed of materials presently in use for other 
implanted medical applications. 

The choice of matrix material is based on 
biocompatibility , biodegradability , mechanical properties , 

35 cosmetic appearance and interface properties. The particular 
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application of the BMP-9 compositions will define the 
appropriate formulation. Potential matrices for the 

compositions may be biodegradable and chemically defined 
calcium sulfate, tricalciumphosphate, hydr oxyapat i te , 
5 polylactic acid and polyanhydrides . Other potential materials 
are biodegradable and biologically well defined , such as bone 
or dermal collagen. Further matrices are comprised of pure 
proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as 

10 sintered hydroxyapatite, bioglass, alumina tes, or other 
ceramics* Matrices may be comprised of combinations of any of 
the above mentioned types of material, such as polylactic acid 
and hydroxyapatite or collagen and tricalciumphosphate. The 
bioceramics may be altered in composition, such as in calcium- 

15 aluminate-phosphate and processing to alter pore size, particle 
size, particle shape, and biodegradability. 

The dosage regimen will be determined by the attending 
physician considering various factors which modify the action 
of the BMP-9 protein, e.g. amount of bone weight desired to be 

20 formed, the site of bone damage, the condition of the damaged 
bone, the size of a wound, type of damaged tissue, the 
patient's age, sex, and diet, the severity of any infection, 
time of administration and other clinical factors. The dosage 
may vary with the type of matrix used in the reconstitution and 

25 the types of BMP proteins in the composition. The addition of 
other known growth factors, such as IGF I (insulin like growth 
factor I), to the final composition, may also effect the 
dosage* Progress can be monitored by periodic assessment of 
bone growth and/or repair, for example, x-rays, 

30 histomorphometric determinations and tetracycline labeling. 

The following examples illustrate practice of the present 
invention in recovering and characterizing murine BMP-9 protein 
and employing it to recover the human and other BMP-9 proteins, 
obtaining the human proteins and expressing the proteins via 

35 recombinant techniques. 
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EXAMPLE X 
Murine BMP-9 

750,000 recombinants of a mouse liver cDNA library made in 
the vector lambdaZAP (Stratagene/ Catalog #935302) are plated 
5 and duplicate nitrocellulose replicas made. A fragment of 
human BMP-4 DNA corresponding to nucleotides 1330-1627 of 
Figure 2 (SEQ ID NO: 3) (the human BMP-4 sequence) is ^P- 
labeled by the random priming procedure of Feinberg et al. 
[Anal. Biochem. 132: 6-13 (1983)] and hybridized to both sets 

10 of filters in SHB at 60°C for 2 to 3 days. Both sets of filters 
are washed under reduced stringency conditions (4X SSC, 0.1% 
SDS at 60°C) . Many duplicate hybridizing recombinants of 
various intensities (approximately 92) are noted. 50 of the 
strongest hybridizing recombinant bacteriophage are plaque 

15 purified and their inserts are transferred to the plasmid 
Bluescript SK (+/-) according to the in vivo excision protocol 
described by the manufacturer (Stratagene) . DMA sequence 
analysis of several recombinants indicate that they encode a 
protein homologous to other BMP proteins and other proteins in 

20 the TGF-0 family • The DNA sequence and derived amino acid 
sequence of one recombinant, designated MLl4a, is set forth in 
Figure 1. (SEQ ID NO: 1) 

The nucleotide sequence of clone MI»14a contains an open 
reading frame of 1284 bp, encoding a BMP-9 protein of 428 amino 

25 acids. The encoded 428 amino acid BMP-9 protein is 
contemplated to be the primary translation product as the 
coding sequence is preceded by 609 bp of 5' untranslated 
sequence with stop codons in all three reading frames. The 428 
amino acid sequence predicts a BMP-9 protein with a molecular 

30 weight of 48,000 daltons. 

Based on knowledge of other BMP proteins and other 
proteins within the TGF-0 family, it is predicted that the 
precursor polypeptide would be cleaved at the muitibasic 
sequence ARG— ARG— LYS— ARG in agreement with a proposed consensus 
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proteolytic processing sequence of ARG-X-X-ARG. Cleavage of 
the BMP-9 precursor polypeptide at this location would generate 
a 110 amino acid mature peptide beginning with the amino acid 
SER at position #319. The processing of BMP-9 into the mature 
5 form is expected to involve dimerization and removal of the N- 
terminal region in a manner analogous to the processing of the 
related protein TGF-0 [I*.E. Gentry, et al., Holec. s Cell, 
Biol. 8:4162 (1988); R. Derynck, et al. , Nature 316:701 
(1985)]. 

10 It is contemplated therefore that the mature active 

species of murine BMP-9 comprises a homodimer of 2 polypeptide 
subunits, each subunit comprising amino acids #3l9-#428 with a 
predicted molecular weight of approximately 12 r 000 daltons. 
Further active species are contemplated comprising amino acids 

15 #326 - #428 thereby including the first conserved cysteine 
residue. As with other members of the BMP and TGF-/S family of 
proteins, the car boxy-terminal region of the BMP-9 protein 
exhibits greater sequence conservation than the more amino- 
terminal portion. The percent amino acid identity of the 

20 murine BMP-9 protein in the cysteine-rich C-terminal domain 
{amino acids #326 - #428) to the corresponding region of other 
human BMP proteins and other proteins within the TGF-/3 family 
is as follows: BMP-2, 53%; BMP-3 , 43%; BMP-4, 53%; BMP-5, 55%; 
BMP-6, 55%; BMP-7, 53%; Vgl r 50%; GDF-1, 43%; TGF-01, 32%; TGF— 

25 02, 34%; TGF-03, 34%; inhibin0(B), 34%; and inhibin 0 (A) , 42%. 

EXAMPLE II 
Human BMP,— 9 

Murine and human osteoinductive factor genes are presumed 
30 to be significantly homologous , therefore the murine coding 
sequence or a portion thereof is used as a probe to screen a 
human genomic library or as a probe to identify a human cell 
line or tissue which synthesizes the analogous human cartilage 
and/ or bone protein. A human genomic library (Toole et al. , 
35 supra ) may be screened with such a probe, and presumptive 
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positives isolated and DNA sequence obtained. Evidence that 
tills recombinant encodes a portion of the human BMP-9 relies of 
the murine/human protein and gene structure homologies. 

Once a recombinant bacteriophage containing DNA encoding 
5 portion of the human cartilage and/or bone inductive factor 
molecule is obtained, the human coding sequence can be used as 
a probe to identify a human cell line or tissue which 
synthesizes BMP-9. Alternatively, the murine coding sequence 
can be used as a probe to identify such human cell line or 

10 tissue. Briefly described, RNA is extracted from a selected 
cell or tissue source and either electrophoresed on a 
formaldehyde agarose gel and transferred to nitrocellulose, or 
reacted with formaldehyde and spotted on nitrocellulose 
directly. The nitrocellulose is then hybridized to a probe 

15 derived from a coding sequence of the murine or human BMP-9. 
xnRKA is selected by oligo (dT) cellulose chromatography and 
cDNA is synthesized and cloned in lambda gtlO or lambda ZAP by 
established techniques (Toole et al., supra) . 

Additional methods known to those skilled in the art may 

20 be used to isolate the human and otfcer species' BMP— 9 proteins 
of the invention. 

A. isolation of Human DNft 

one million recombinants of a human genomic library 

25 constructed in the vector XFIX (Stratagene catalog # 944201) 
are plated and duplicate nitrocellulose replicas made. Two 
oligonucleotides probes designed on the basis of nucleotides 
#1665-#1704 and #1837-#1876 of the sequence set forth in Figure 
1 (SEQ ID N0:1) are synthesized on an automated DNA 

30 synthesizer. The sequence of these two oligonucleotides is 

indicated below: 

#1: CTATGAGTGTAAAGGGGGTTGCTTCTTCCCATTGGCTGAT 
#2: GTGCCAACCCTCAAGTACCACTATGAGGGGATGAGTGTGG 

These two oligonucleotide probes are radioactively labeled with 
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T^P-ATP and each is hybridized to one set of the duplicate 
nitrocellulose replicas in SHB at 65°C and washed with IX SSC r 
0.1% SDS at 65°C. Three recombinants which hybridize to both 
oligonucleotide probes are noted. All three positively 
5 hybridizing recombinants are plague purified, bacteriophage 
plate stocks are prepared and bacteriophage DNA is isolated 
from each. The oligonucleotide hybridizing regions of one of 
these recombinants, designated HGlll, is localized to a 1.2 kb 
Pst I/Xba I fragment. This fragment is subcloned into a 

10 plasmid vector (pGEM-3) and. DNA sequence analysis is performed* 
HGlll was deposited with the ATCC, 12301 Parklawn Drive, 
Rockville, Maryland USA on June 16, 1992 under the requirements 
of the Budapest Treaty and designated as ATCC # 75252. This 
subclone is designated pGEM-111. A portion of the DNA sequence 

15 of clone pGEM-111 is set forth in Figure 3 (SEQ ID NO: 8/ HUMAN 
BMP-9 sequence) . This sequence encodes the entire mature 
region of human BMP-9 and a portion of the propeptide. Xt 
should be noted that this sequence consists of preliminary 
data. Particularly, the propeptide region is subject to 

20 further analysis and characterization. For example, 

nucleotides #1 through #3 (TGA) encode a translational stop 
which may be incorrect due to the preliminary nature of the 
sequence. It is predicted that additional sequences present in 
both pGEM-111 (the 1.2 kb Pstl/Xbal fragment of HGlll subcloned 

25 into pGEM) and HGlll encode additional amino acids of the human 
BMP-9 propeptide region- Based on knowledge of other BMPs and 
other proteins within the TGF-0 family, it is predicted that 
the precursor polypeptide would be cleaved at the multibasic 
sequence ARG-ARG— LYS—ARG (amino acids # -4 through # -1 of 

30 SEQUENCE ID NO: 9) in agreement with a proposed consensus 
proteolytic processing sequence ARG-X-X-ARG. Cleavage of the 
human BMP-9 precursor polypeptide at this location would 
generate a 110 amino acid mature peptide beginning with the 
amino acid SER at position #1 of SEQUENCE ID NO: 9 (encoded by 
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nucleotides #124 through #126 of SEQUENCE XD HO: 8) . The 
processing of human BMP-9 into the mature form is expected to 
involve dimerization and removal of the N-terminal region in a 
manner analogous to the processing of the related protein TGF-0 
5 [L.E. Gentry, et al., Molec. & Cell* Biol. 8t4162 (1988); R* 
Derynck, et al., Nature 316:701 (1985)]. 

It is contemplated therefore that the mature active 
species of human BMP-9 comprises .a homodimer of two polypeptide 
subunits, each subunit comprising amino acids #1 through #110 

10 of SEQUENCE ID NO: 9, with a predicted molecular weight of 
12,000 daltons. Further active species are contemplated 
comprising amino acids #8 through #110 thereby including the 
first conserved cysteine residue. As with other members of the 
BMP and TGF-0 family of proteins, the carboxy- terminal portion 

15 of the human BMP-9 sequence exhibits greater sequence 
conservation than the amino-terminal portion- the percent 
amino acid identity of the human BMP-9 protein in the cysteine- 
rich C- terminal domain (amino acids #8 through #110) to the 
corresponding region of other human BMP proteins and other 

20 proteins within the TGF-0 family is as follows: BMP-2, 52%; 

BMP-3, 40%; BMP-4, 52%; BMF-5, 55%; BMP-6, 55%; BMP-7, 53%; 
murine BMP— 9 , 97%; Vgl, 50%; GDF-1, 44%; TGF-01, 32%; TGF-02, 
32%; TGF-03, 32%; inhibin 0 (B) , 35%; and inhibin £ (A), 41%. 

25 EXAMPLE III 

Rosen Modified Sam nath-Reddi Assay 

A modified version of the rat bone formation assay 
described in Sampath and Reddi, Proc. Natl, Acad. Scj. U,S.A^, 
80:6591-6595 (1983) is used to evaluate bone and/ or cartilage 

30 activity of the BMP proteins. This modified assay is herein 
called the Rosen-modified Sampath-Reddi assay. The ethanol 
precipitation step of the Sampath-Reddi procedure is replaced 
by dialyzing (if the composition is a solution) or diaf titer ing 
(if the composition is a suspension) the fraction to be assayed 

35 against water- The solution or suspension is then redissolved 
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in o.l % TFA, and the resulting solution added to 20xng of rat 
matrix, A mock rat matrix sample not treated with the protein 
serves as a control. This material is frozen and lyophili2ed 
and the resulting powder enclosed in #5 gelatin capsules. The 
5 capsules are implanted subcutaneously in the abdominal thoracic 
area of 21 - 49 day old male Long Evans rats. The implants are 
removed after 7-14 days. Half of each implant is used for 
alkaline phosphatase analysis [See, A. H. Reddi et al„, 
Proc. Na tl Acad Sci. , 69:1601 (1972)]. 

!0 The other half of each implant is fixed and processed for 

histological analysis. l^m glycolmethacrylate sections are 
stained with Von Kossa and acid fuschin to score the amount of 
induced bone and cartilage formation present in each implant. 
The terms +1 through +5 represent the area of each 

15 histological section of an implant occupied by new bone and/or 
cartilage cells and matrix. A score of +5 indicates that 
greater than 50% of the implant is new bone and/or cartilage 
produced as a direct result of protein in the implant. A score 
of +4, +3 r +2 and +1 would indicate that greater than 40% , 

20 30% r 20% and 10% respectively of. the implant contains new 
cartilage and/or bone. In a modified scoring method, three 
non-adjacent sections are evaluated from each implant and 
averaged. «+/-« indicates tentative identification of 
cartilage or bone; "+l H indicates >10% of each section being 

25 new cartilage or bone; "+2", >25%; "+3", >50%; "+4", -75%; 
"+5", >80%. A indicates that the implant is not recovered. 

It is contemplated that the dose response nature of the 
BMP-9 containing samples of the matrix samples will demonstrate 
that the amount of bone and/or cartilage formed increases with 

3 0 the amount of BMP-9 in the sample. It is contemplated that the 
control samples will not result in any bone and/or cartilage 
formation. 

As with other cartilage and/or bone inductive proteins 
such as the above-mentioned "BMP" proteins, the bone and/or 
35 cartilage formed is expected to be physically confined to the 
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space occupied by the matrix. Samples are also analyzed by SDS 
gel electrophoresis and isoelectric focusing followed by 
autoradiography- The activity is correlated with the protein 
bands and pi. To estimate the purity of the protein in a 
5 particular fraction an extinction coefficient of 1 OD/mg-cm is 
used as an estimate for protein and the protein is run on SDS 
PAGE followed by silver staining or radioiodination and 
autoradiography . 

10 EXAMPLE IV 

Expression of BMP-9 

In order to produce murine r human or other mammalian BMP— 9 
proteins, the DNA encoding it is transferred into an 
appropriate expression vector and introduced into mammalian 

15 cells or other preferred eukaryotic or prokaryotic hosts by 
conventional genetic engineering techniques- The preferred 
expression system for biologically active recombinant human 
BMP-9 is contemplated to be stably transformed mammalian cells. 
One skilled in the art can construct mammalian expression 

20 vectors by employing the sequence of Figure 1 (SEQ ID NO: 1) or 
Figure 3 (SEQ ID NO: 8) , or other DNA sequences encoding BMP-9 
proteins or other modified sequences and known vectors, such as 
pCD fOkayama et al., Mol. Cell piol. , 2:161-170 (1982)], pJL3, 
pjX4 [Gough et al. , EMBO J\, 4:645-653 (1985)] and pMT2 CXM. 

25 The mammalian expression vector pMT2 CXM is a derivative 

of p91023 (b) (Wong et al. , Science 228:810-815, 1985) 
differing from the latter in that it contains the ampicillin 
resistance gene in place of the tetracycline resistance gene 
and further contains a Xhol site for insertion of cDNA clones, 

30 The functional elements of pMT2 CXM have been described 
(Kaufman, R.J* , 1985, Proc. Natl- Acad. Sci. USA 82:689-693) 
and include the adenovirus VA genes, the SV40 origin of 
replication including the 72 bp enhancer, the adenovirus major 
late promoter including a 5' splice site and the majority of 
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the adenovirus tripartite leader sequence present on adenovirus 
late mRNAs, a 3' splice acceptor site p a OHFR insert, the SV40 
early polyadenylation site (SV40) , and pBR322 sequences needed 
for propagation in JLs. cjgJJL. 
5 Plasmid pMT2 CXM is obtained by EcoRI digestion of pMT2- 

VWF, which has been deposited with the American Type Culture 
Collection (ATCC) , Rockville, MD (USA) under accession number 
ATCC 67122. EcoRI digestion excises the cDNA insert present in 
PMT2-VWF, yielding pMT2 in linear form which can be ligated and 

10 used to transform E. coli HB 101 or DH-5 to ampicillin 
resistance. Plasmid pMT2 DNA can be prepared by conventional 
methods, pMT2 CXM is then constructed using loopout/in 
mutagenesis [Morinaga, et al« , Biotechnology 84 ; 636 (1984). 
This removes bases 1075 to 1145 relative to the Hind III site 

15 near the SV40 origin of replication and enhancer sequences of 
pMT2. In addition it inserts the following sequence: 

5' PO-CATGGGCAGCTCGAG-3 ' (SEQ ID NO: 5) 
at nucleotide 1145. This sequence contains the recognition 
site for the restriction endonuclease Xho I« A derivative of 

20 pMT2CXM, termed pMT23, contains recognition sites for the 
restriction endonucleases Fstl, Eco RI, Sail and Xhol. Plasmid 
pMT2 CXM and pMT23 DNA may be prepared by conventional methods. 

pEMC2bl derived from pMT2l may also be suitable in 
practice of the invention. pMT21 is derived from pMT2 which is 

25 derived from pMT2-VWF. As described above EcoRI digestion 
excises the cDNA insert present in pMT-VWF, yielding pMT2 in 
linear form which can be ligated and used to transform JS^ Coli 
HB 101 or DH-5 to ampicillin resistance. Plasmid pMT2 DNA can 
be prepared by conventional methods. 

30 pMT21 is derived from pMT2 through the following two 

modifications. First, 76 bp of the 5' untranslated region of 
the DHFR cDNA including a stretch of 19 G residues from G/C 
tailing for cDNA cloning is deleted. In this process a Xhol 
site is inserted to obtain the following sequence immediately 

35 
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upstream from DHFR: 5' -C^GC^GCGAGCCTGM2£COTCGMCC&TCATG-3 ' 

PStI ECO RI XhOl 

{SEQ ID NO: 6) 

Second, a unique Clal site is introduced by digestion with 
5 EcoRV and Xbal, treatment with Klenow fragment of DNA 
polymerase I, and ligation to a Clal linker (CATCGATG) . This 
deletes a 250 bp segment from the adenovirus associated RNA 
(VAX) region but does not interfere with VAI RNA gene 
expression or function. pMT21 is digested with EcoRI and Xhol, 

10 and used to derive the vector pEMC2Bl. 

A portion of the EMCV leader is obtained from pKT2-ECATl 
[S.K. Jung, et al, J. Virol 63; 1651-1660 (1989)] by digestion 
with Eco RI and PstI, resulting in a 2752 bp fragment- This 
fragment is digested with TaqI yielding an Eco RI-TaqI fragment 

15 of 508 bp which is purified by electrophoresis on low melting 
agarose gel. A 68 bp adapter and its complementary strand are 
synthesized with a 5' TaqI protruding end and a 3' Xhol 
protruding end which has the following sequence: 

20 5 ' — fiSStGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTT 

TaqI 

GAAAAACACGATXGC-3 ' 

XhOl (SEQ ID NO: 7) 

This sequence matches the EMC virus leader sequence from 
nucleotide 763 to 827. It also changes the ATG at position 10 
within the EMC virus leader to an ATT and is followed by a Xhol 
site. A three way ligation of the pMT21 Eco Rl-Xhol fragment, 
30 the EMC virus EcoRI-Taql fragment, and the 68 bp 

oligonucleotide adapter Taql-Xhol adapter resulting in the 
vector pEMC201. 

This vector contains the SV40 origin of replication and 
35 enhancer, the adenovirus major late promoter, a cDNA copy of 
the majority of the adenovirus tripartite leader sequence, a 
small hybrid intervening sequence, an SV4 0 polyadenylation 
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signal and -the adenovirus VA I gene, DHFR and /?- lactamase 
markers and an EMC sequence, in appropriate relationships to 
direct the high level expression of the desired cDNA in 
mammalian cells. 

5 The construction of vectors may involve modification of 

the BMP-9 DNA sequences. For instance, BMP -9 cDNA can be 
modified by removing the non-coding nucleotides on the 5 r and 
3 r ends of the coding region. The deleted non-coding 
nucleotides may or may not be replaced by other sequences known 

10 to be beneficial for expression. These vectors are transformed 
into appropriate host cells for expression of BMP-9 proteins. 

one skilled in the art can manipulate the sequences of 
Figure 1 or Figure 3 (SEQ ID NO: 1 and 8) by eliminating or 
replacing the mammalian regulatory sequences flanking the 

15 coding sequence with bacterial sequences to create bacterial 
vectors for intracellular or extracellular expression by 
bacterial cells. For example, the coding sequences could be 
further manipulated (e.g. ligated to other known linkers or 
modified by deleting non-coding sequences therefrom or altering 

20 nucleotides therein by other known. techniques) . The modified 
BMP-9 coding sequence could then be inserted into a known 
bacterial vector using procedures such as described in T. 
Taniguchi et al., Proc. Natl Acad. Sci. USA , 77:5230-5233 
(1980). This exemplary bacterial vector could then be 

25 transformed into bacterial host cells and a BMP-9 protein 
expressed thereby. For a strategy for producing extracellular 
expression of BMP— 9 proteins in bacterial cells , see, e.g. 
European patent application EPA 177,343. 

Similar manipulations can be performed for the 

30 construction of an insect vector [See, e.g. procedures 
described in published European patent application 155,476] for 
expression in insect cells. A yeast vector could also be 
constructed employing yeast regulatory sequences for 
intracellular or extracellular expression of the factors of the 

3 5 present invention by yeast cells. [See, e.g., procedures 
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described in published PCT application WO86/ 00639 and European 
patent application EPA 123,289]. 

A method for producing high levels of a BMP-9 protein of 
the invention in mammalian cells may involve the construction 
5 of cells containing multiple copies of the heterologous BMP-9 
gene. The heterologous gene is linked to an amplifiable 
marker, e.g. the dihydrof olate reductase (DHFR) gene for which 
cells containing increased gene copies can be selected for 
propagation in increasing concentrations of methotrexate {MTX) 

10 according to the procedures of Kaufman and Sharp, i — WQl* 
Biol, , 159:601-629 (1982) . This approach can be employed with 
a number of different cell types. 

" For example, a plasmid containing a DNA sequence for a 
BMP-9 of the invention in operative association with other 

IS plasmid sequences enabling expression thereof and the DHFR 
expression plasmid pAdA26SV(A)3 [Kaufman and Sharp, Jftol. cell. 
Bj.pl. . 2:1304 (1982)] can be co-introduced into DHFR— def i cient 
CHO cells, DUKX-BXI, by various methods including calcium 
phosphate coprecipitation and transfection, electroporation or 

20 protoplast fusion. DHFR expressing transformants are selected 
for growth in alpha media with dialyzed fetal calf serum, and 
subsequently selected for amplification by growth in increasing 
concentrations of MTX (e.g. sequential steps in 0.O2, 0.2 , 1.0 
and SUM MTX) as described in Kaufman et al. , jffgj. Cell, fijol. , 

25 5:1750 (1983). Transformants are cloned, and biologically 
active BMP— 9 expression is monitored by the Rosen-modified 
Sampath - Reddi rat bone formation assay described above in 
Example III. BMP-9 expression should increase with increasing 
levels of MTX resistance. BMP— 9 polypeptides are characterized 

30 using standard techniques known in the art such as pulse 
labeling with [35SJ methionine or cysteine and poly acryl amide 
gel electrophoresis. Similar procedures can be followed to 
produce other related BMP-9 proteins. 

35 
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A. BMP- 9 Vector Con struct: ion 

In order to produce human BMP -9 proteins of the invention 
DNA sequences encoding the mature region of the human BMP-9 
protein may be joined to DNA sequences encoding the propeptide 
5 region of the murine BMP-9 protein. This murine/human hybrid 
DNA sequence is inserted into an appropriate expression vector 
and introduced into mammalian cells or other preferred 
eukaryotic or prokaryotic hosts by conventional genetic 
engineering techniques. The construction of this murine/human 

10 BMP-9 containing expression plasmid is described below* 

A derivative of the human BMP-9 sequence (SEQ ID NO: 8) 
comprising the nucleotide sequence from nucleotide #105 to #470 
is specifically amplified- The following oligonucleotides are 
utilized as primers to allow the amplification of nucleotides 

15 #105 to #470 of the human BMP-9 sequence (SEQ ID NO: 8) from 
clone pGEM-111 described above. 

#3 ATCGG6CCCCTTTTAGCCAGGCGGAAAA6GAG 
#4 AGCGAATTCCCCGCAGGCAGATACTACCTG 
This procedure generates the insertion of the nucleotide 

20 sequence ATCGGGCCCCT immediately proceeding nucleotide #105 and 
the insertion of the nucleotide sequence GAATTCGCT immediately 
following nucleotide #470. The addition of these sequences 
results in the creation of an Apa I and EcoR I restriction 
endonuclease site at the respective ends of the specifically 

25 amplified DNA fragment. The resulting 374 bp Apa X/EcoR I 
fragment is subcloned into the plasmid vector pGEM-7Zf(+) 
CPromega catalog* p2251) which has been digested with Apa I and 
EcoR I. The resulting clone is designated phBMP9mex-l. 

30 The following oligonucleotides are designed on the basis 

of murine BMP— 9 sequences (SEQ ID N0:1) and are modified to 
facilitate the construction of the murine/human expression 
plasmid referred to above: 
#5 

35 GATTCCGTCGACCACCATGTCCCCTGGGGCCTGGTCTAGATGGATACACAGCTGTGGGGCC 
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10 



#6 CCACAGCTGTGTATCCATCTAGACCAGGCCCCAGGGGACATGGTGGTCGACG 
These oligonucleotides contain complimentary sequences which 
upon addition to each other facilitate the annealing (base 
pairing) of the two individual sequences, resulting in the 
formation of a double stranded synthetic DNA linker (designated 
LINK-1) in a manner indicated below: 

1 5 10 20 30 40 50 60 




This DNA linker (LINK-1) contains recognition sequences of 
restriction endonucleases needed to facilitate subsequent 
manipulations required to construct the murine/human expression 
plasmid, as well as sequences required for maximal expression 

15 of heterologous sequences in mammalian cell expression systems. 
More specifically (referring to the • sequence numbering of 
oligonucleotide #S/u:NK-l) : nucleotides comprise 
recognition sequences for the restriction endonucleases BamH 1 
and Sal I, nucleotides #11-#15 allow for maximal expression of 

20 heterologuos sequences in mammallian cell expression systems, 
nucleotides #16-#31 correspond to nucleotides #610-#625 of the 
murine BMP-9 sequence (SEQ ID K0:1) , nucleotides #32-#33 are 
inserted to facilitate efficient restriction digestion of two 
adjacent restriction endonuclease sites (Eco0109 I and Xba I) , 

25 nucleotides #34-#60 correspond to nucleotides #1S15-#1541 of 
the murine BMP— 9 sequence (SEQ ID NO:l) except that nucleotide 
#58 of synthetic oligonucloetide #5 is a G rather than the A 
which appears at position #1539 of SEQ ID NO : 1 (This nucleotide 
conversion results in the creation of an Apa I restriction 
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endonuclease recognition sequence , without altering the amino 
acid sequence it is intended to encode, to facilitate further 
manipulations of the murine/human hybrid expression plasmid. 
LiNK-l (the double stranded product of the annealing of 
5 oligonucleotides #5 and #6) is subcloned into the plasmid 
vector pGEM-7Zf (+) which has been digested with the restriction 
endonucleases Apa I and BamH I. This results in a plasmid in 
which the sequences normally present between the Apa I and BamH 
I sites of the pGEM-7Zf (+) plasmid polylinker are replaced with 

10 the sequences of LINK-1 described above. The resulting plasmid 
clone is designated pBMP-9link. 

pBMP-91ink is digested with the restriction endonucleases 
BamH I and Xba I resulting in the removal nucleotides #l-#34 of 
(refer to the numbering of oligo #5). Clone ML14a r 

15 which contains an insert comprising the sequence set forth in 
SEQ ID NO:l, is also digested with the restriction 
endonucleases BamH I and Xba I resulting in the removal of 
sequences comprising nucloetides #1-#1515 of SEQUENCE ID NO:l 
(murine BMP-9). This BamH I/Xba I fragment of mouse BMP-9 is 

20 isolated from the remainder of the KL14a plasmid clone and 
subcloned into the BamH I/Xba I sites generated by the removal 
of the synthetic linker sequences described above. The 
resulting clone is designated p302. 

The p302 clone is digested with the restriction 

25 endonuclease Eco0109 I resulting in the excision of nucloetides 
corresponding to nucleotides #621-#1515 of the murine BMP-9 
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sequence (SEQ ID NO:l) and nucleotides #35-#59 of LINK-1 (refer 
to numbering of oligonucleotide #5) • It should be noted that 
the Apa I restriction site created in LINK-l by the A to G 
conversion described above is a subset of the recognition 
5 sequence of EcoO109 I f therefore digestion of p302 with EcoO109 
I cleaves at the Apa I site as well as the naturally occuring 
murine Eco0109 I (location #619-#625 of SEQ ID NO:l) resulting 
in the excision of a 920 bp Eco0109 I/EcoO109 I (Apa I) 
fragment comprising the sequences described above. This 920 

10 Eco0109 I/Eco0109 I (Apa I) fragment is isolated from the 
remainder of the p302 plasmid clone and subcloned into clone 
pBMP-91ink which has been similarly digested with EcoO109 I. 
It should be noted that the nucleotides GG (#32-#33 of 
oligonucleotide #5) originally designed to facilitate a more 

IS complete digestion of the two adjacent restriction sites 
ECQO109 I and Xba I of LINK-1, which is now a part of pBKP- 
91ink (described above) , results in the creation of Dcm 
methylation recognition sequence. The restriction nuclease 
ECO0109 I is sensitive to Dcm methylation and therefore 

20 cleavage of this sequence (nucleotides #25-#3l of 
oligonucleotide #5/LINK-l) by the restriction endonuclease 
EC00109 I is prevented at this site. Therefore the plasmid 
clone pBMP-91ink is cleaved at the Apa X site but not at the 
Eco0109 I site upon digestion with the restriction endonuclease 

25 EcoO109 I as described above, preventing the intended* removal 
of the sequences between the EcoO!09 I and Xba I site of LINK-1 
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(#32-#55 defined by the numbering of oligonucleotide #5) • This 
results in the insertion of the 920 bp EcoO109 I/Apa I fragment 
at the Eco0109 I (Apa I) site of pBMP-91ink. The resulting 
clone is designated p3l8. 
5 Clone p318 is digested with the restriction endonucleases 

Sal X and Apa I, resulting in the excision of sequences 
comprising nucleotides #6-#56 of LINK-1 (refer to oligo #5 for 
location) , nucleotides #621-#1515 of murine BMP-9 (SEQ TO 
NO:l) , and nucleotides #35-#60 of LINK-1 (refer to oligo #5 for 

10 location) . The resulting 972 bp Sal I/Apa I fragment described 
above is isolated from the remainder of the p318 plasmid clone 
and will be utilized in subsequent manipulations. 

The clone phBMP9mex-l (described above) , which contains 
DNA sequences which encode the entire mature region and 

15 portions of the propeptide of the human BMP-9 protein, is 
digested with the restriction endonucleases Apa I and EcoR X. 
This results in the excision of a 374 bp fragment comprising 
nucleotides #105-#470 of the human BMP-9 sequence (SEQ ID NO:8) 
and the additional nucleotides of oligonucleotide primers #3 

20 and #4 which contain the recognition sequences for the 
restriction endonucleases Apa I and EcoR I. This 3 74 bp Apa 
I /EcoR I fragment is combined with the 972 bp Sal I/Apa I 
fragment from pi38 (isolation described above) and ligated to 
the mammalian cell expression plasmid pED6 (a derivative of 

25 pEMC201) which has been digested with Sal I and EcoR I. The 
resulting clone is designated p324. 
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The clone ML14a (murine BMP-9) is digested with EcoO109 I 
and Xba I to generate a fragment comprising nucleotides #621- 
#1515 of SEQ ID NO:l. 

The following oligonucleotides are synthesized on an 
5 automated DNA synthesizer and combined such that their 
complimentary sequences can base pair (anneal) with each other 
to generate a double stranded synthetic DNA linker designated 
LINK-2: 

#7 TCGACCACCATGTCCCCTGG 

ID #8 GCCCCAGGGGACATGGTGG 

This double stranded synthetic DNA linker (LINK-2) anneals in 

such a way that it generates single stranded ends which are 

compatible to DNA fragments digested with Sal I (one end) or 

Eco0109 I (the other end) as indicated below: 

15 #7 TCGACCACCATGTCCCCTGG 

GGTGGTACAGGGGACCCCG #8* 

This LINK-2 synthetic DNA linker is ligated to the 895 bp 

ECOO109 I/Xba I fragment comprising nucleotides #621-#1515 of 

murine BMP-9 (SEQ ID NO:l) described above. This results in a 

20 915 bp Sal I/Xba I fragment. 

The clone p324 is digested with Sal I/Xba I to remove 
sequences comprising nucleotides #6-#56 of LINK-1 (refer to 
oligo #5 for location) and nucleotides #62l-#1515 of murine 
BMP-9 (SEQ ID N0:1) . The sequences comprising nucleotides #35- 

25 #60 of LINK-l (refer to oligo #5 for location) and the 
sequences comprising the 374 bp Apa I/EcoR I fragment (human 
BMP-9 sequences) derived from phBMP9mex-l remain attached to 
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the pED6 backbone. The 915 hp Sal I/Xba I fragment: comprising 
LINK-2 sequences and nucleotides #621-#1515 of murine BMP-9 
(SEQ ID NO:l) is ligated into the p324 clone from which the Sal 
I to Xba I sequences described above have been removed. 
5 The resulting plasmid is designated BMP9fusion and 

comprises LINK-2, nucleotides #621-#1551 of murine BMP-9 (SEQ 
ID NO:l) , nucleotides #35-#59 of LINK-1 (refer to the numbering 
of oligonucleotide #5) , and the 374 bp Apa I/EcoR I fragment: 
{human BMP-9) derived from clone pBMP9mex-l (described above) 

10 inserted between the Sal I and EcoR I sites of the mammalian 
cell expression vector pED6. 

BMP 9 fusion is transfected into CHO cells using standard 
techniques known to those having ordinary skill in the art to 
create stable cell lines capable of expressing human BMP— 9 

15 protein- The cell lines are cultured under suitable culture 
conditions and the BMP-9 protein is isolated and purified from 
the culture medium. 

EXAMPLE V 

20 Biological Activity of Expressed BMP-9 

To measure the biological activity of the expressed BMP-9 
proteins obtained in Example IV above, the proteins are 
recovered from the cell culture and purified by isolating the 
BMP— 9 proteins from other proteinaceous materials with which 

25 they are co-produced as well as from other contaminants. The 
purified protein may be assayed in accordance with the rat bone 
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formation assay described in Example III. 

Purification is carried out using standard techniques 
known to those skilled in the art. It is contemplated, as with 
other BMP proteins, that purification may include the use of 
5 Heparin sepharose. 

Protein analysis is conducted using standard techniques 
such as SDS-PAGE acrylamide [U.K. Laemmli, Nature 227:680 
(1970) ] stained with silver [R.R- Oakley , et al. Ana?,, qiochem. 
105:361 {1980) 3 and by immunoblot [H. Towbin, et al. Proc. 
10 Natl. Acad. Sci. USA 76:4350 (1979)] 

The foregoing descriptions detail presently preferred 
embodiments of the present invention. Numerous modifications 
and variations in practice thereof are expected to occur to 
those skilled in the art upon consideration of these descrip- 
15 tions. Those modifications and variations are believed to be 
encompassed within the claims appended hereto. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Wozney, John M. 

Celest^, Anthony 

(ii) TITLE OF INVENTION: BMP-9 COMPOSITIONS 
(iii) NUMBER OF SEQUENCES : 9 

(iv) CORRESPONDENCE ADDRESS: 

[A* ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: Legal Affairs - 87 CaafcridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: US 

(F) ZIP: 02140 . 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
fB) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32 , 245 

(C) REFERENCE/DOCKET NUMBER: GI 518 6A 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 876-1170 

(B) TELEFAX: (617) 876-5851 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MUS zmsculUS 

(B) STRAIN: C57B46XCBA 
(F) TISSUE TYPE: liver 
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(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Mouse liver cDNA 

(B) CLONE: ML14A 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1564, .1893 

(ix) PERTURBS 

(A) NAME/KEY: COS 

CB) LOCATION: 610. .1896 

(ix) FEATURE: 

(A) NAME/ KEY: mRNA 

(B) LOCATION: 1..2447 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CATTAATAAA 


TATTAAGTAT 


TGGAATTAGT 


GAAATTGGAG 


TTCCTTGTGG 


AAGGAAGTGG 


60 


GCAAGT6AGC 


TTTTTAGTTT 


GTGTCGGAAG 


CCTGTAATTA 


CGGCTCCAGC 


TCATAGTGGA 


120 


ATGGCTATAC 


TTAGATTTAT 


GGATAGTTGG 


GTAGTAGGTG 


TAAATGTATG 


TGGTAAAAGG 


180 


CCTAGGAGAT 


TTGTTGATCC 


AATAAATATG 


ATTAGGGAAA 


CAATTATTAG 


GGTTCATGTT 


240 


CGTCCTTTTG 


GTGTGTGGAT 


TAGCATTATT 


TGTTTGATAA 


TAAGTTTAAC 


TAGTCAGTGT 


300 


TG GAAAGAAT 


GGAGACGGTT 


GTTGATTAGG 


CGTTTTGAGG 


ATGGGAATAG 


GATTGAAGGA 


360 


AATATAATGA 


TGGCTACAAC 


GATTGGGAAT 


CCTATTATTG 


TTGGGGTAAT 


GAATGAGGCA 


420 


AATAGATTTT 


CGTTCATTTT 


AATTCTCAAG 


GGGTTTTTAC 


TTTTATGTTT 


GTTAGTGATA 


480 


TTGGTGAGTA 


GGCCAAGGGT 


TAATAGTGTA 


ATTGAATTAT 


AGTGAAATCA 


TATTACTAGA 


540 


CCTGATGTTA 


GAAGGAGGGC 


TGAAAAGGCT 


CCTTCCCTCC 


CAGGACAAAA 


CCGGAGCAGG 


600 


GCCACCCGG ATG TCC CCT 


GGG GCC TTC CGG GTG GCC CTG CTC CCG CTG 


648 


Met Ser Pro 


Gly Ala Pne Arg Val Ala Leu Leu Pro Leu 




•318 


-315 


-310 







TTC CTG CTG GTC TGT GTC ACA GAG CAG AAG CCG CTG CAG AAC TGG GAA 696 
Phe Leu Leu Val Cys Val Tnr Gin Gin Lys Pro Leu Gin Asn Trp Glu 
•305 -300 -295 -290 

CAA GCA TCC CCT GGG GAA AAT GCC CAC AGC TCC CTG GGA TTG TCT GGA 744 
Gin Ala Ser Pro Gly Glu Asn Ala His Ser Ser Leu Gly Leu Ser Gly 
-285 -280 -275 

GCT GGA GAG GAG GGT GTC TTT GAC CTG CAG ATG TTC CTG GAG AAC ATG 792 
Ala Gly Glu Glu Gly Val Pne Asp Leu Gin Met Pne Leu Glu Asn Met 
-270 -265 -260 

AAG GTG GAT TTC CTA CGC AGC CTT AAC CTC AGC GGC ATT CCC TCC CAG 840 
Lys Val Asp Phe Leu Arg Ser Leu Asn Leu Ser Gly lie Pro Ser Gin 
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(2) INFORMATION FOR SEQ ID NO* 9: 

(i) sequence characteristics: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

* Thr Arg Glu Cys ser Arg Ser Cys Pro Arg Thr Ala Pro Gin Arg 
-41 -40 -35 -30 

Gin Val Arg Ala Val Thr Arg Arg Thr Arg Met Ala His Val Ala Ala 
-25 ~ -20 -15 -10 

Gly Ser Thr Leu Ala Arg Arg Lys Arg Ser Ala Gly Ala Gly Ser His 
-5 1 5 

Cys Gin Lys Thr Ser Leu Arg Val Asn Phe Glu Asp lie Gly *rp Asp 
10 '15 20 

» . . * 

Ser Trp lie* lie Ala Pro Lys Glu Tyr Glu Ala Tyr Glu Cys Lys Gly 
25 30 35 

Gly Cys Phe Phe Pro Leu Ala Asp Asp Val Thr Pro Thr Lys His Ala 
40 45 50 55 

Xle Val Gin Thr Leu Val His Leu Lys Phe Pro Thr Lys Val Gly Lys 
60 65 70 

Ala cys Cys Val Pro Thr Lys Leu Ser Pro He Ser Val Leu Tyr Lys 
75 80 85 

Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His Tyr Glu Gly Met Ser 
90 95 100 

Val Ala Glu cys Gly Cys Arg 
105 HO 
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fix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATIONS 1-.470 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1-.456 

fix) FEATURE: . 4> , rt 

(A) NAME/KEY: mat^peptide 

(B) LOCATION: X24„«.453 

(ix) FEATURE: . 

V ' (A) SAME/KEY: mRHA 

(B) LOCATION: 1^470 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

* Thr Arg Glu cys ser Arg Ser cys «:o ^ 

-41 -40 



48 
96 
144 
192 
240 
288 



-41 — *U 

SSiSSSiSiSSSSSKSSSSS 

-25 " 20 

-issgasgssssssss 

* r-rc ATC TCC GTC CTC TAC AAG 384 

75 

««« «P*r cm? TAC GAG GGC ATG AGC * J<t 

90 ^ 470 

GTG GCA GAG TGT GGG TGC AGG TAGIATCTGC CTGCGGG 
VaX La Glu cys Gly cys Arg 
ios ■ LJ - U 
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(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGCAGGCGA GCCTGAATTC CTCGAGCCAT CATG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA *tO BRNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
OGAGGTTAAA AAACGTCTAG GCCCCCCGAA CCACGGGGAC GTGGTTTTCC TTTGAAAAAC 
ACGATTGC 

(2) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: W138 (genomic DNA) 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: human genomic library 

(B) CLONE: lambda 111-1 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 
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Arc lie Asn lie Tyr Gin Val Met Lys Pro Pro Ala Glu Val Val Pro 

-1X5 -HO *" 105 

Gly His Leu lie Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
-100 -95 " 90 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
-80 -75 -/0 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala lie Glu Val Tar His 



-65 

-50 



Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg lie Ser Arg 



Ser Leu Pro Gin Gly ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
-35 -30 -25 

val Thr Phe Gly His Asp Gly Arg Gly Hie Ala Leu Thr Arg Arg Arg 

-20 -15 

Arg Ala Lys Arg ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 

Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
15 20 25 

Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
.30 35 40 

cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 

Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser Xle 
65 70 75 

Pro Lys Ala Cys cys Val Pro Thr Glu Leu Ser. Ala lie Ser Met Leu 
80 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 



85 90 



95 



Val Val Glu Gly Cys Gly Cys Arg 
110 115 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES cDNA to mBNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 



SUBSTITUTE SHEET 



WO 93/00432 3? PCT/US92/05374 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 
cys Gly cys Arg 
115 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 1726 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 1786 

ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 1846 

TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 

AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 

(2) INFORMATION FOR SEQ ID NO: 4 J 

(1) SEQUENCE CHARACTERISTICS : 

(A) I^ENGTHs 408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met He Pro Gly Asn Arg Met Leu Met Val Val I*eu Leu cys Gin Val 
-292 -290 -285 -280 

lieu Leu Gly Gly Ala Ser His Ala ser Leu lie Pro Glu Thr Gly Lys 
-275 -270 -265 

Lys Lys Val Ala Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly 

-260 -255 "250 "245 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala, Thr Leu Leu Gin Met 
-240 -235 -230 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro 
-225 -220 -215 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 
-210 -205 -200 

Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
-195 -190 -185 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
-180 -175 -170 ~ 165 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
-160 -155 -150 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
-145 -140 -135 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
-130 -125 -120 
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GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 942 

Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg Xle Asn lie 
-125 -120 -115 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu Xle 
-110 ^105 -100 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 1038 
Thr Arg Len l«eu Asp Thr Arg l*eu Val His His Asn Val Thr Arg Trp 
-95 -90 -85 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 1086 
Glu Thr Fiie Asp Val Ser Pro Ala Val lieu Arg Trp Thr Arg Glu Lys 
-80 -75 -70 -65 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
Gin Pro Asn Tyr Gly Leu Ala lie Glu Val Thr His Leu His Gin Thr 
-60 -55 -SO 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 1182 
Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
• -45 -40 -35 

GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC 6TG GTC ACC TTT GGC 1230 
Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
-30 "25 -20 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 
His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
-15 -10 -S 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 1326 
Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
1 5 10 I 5 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT -GAC 1374 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
20 25 30 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAG TGC CAT GGG GAC 1422 
Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys Hxs Gly Asp 
35 40 45 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 1470 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala lie 
50 55 60 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 1518 
Val Gin Thr Leu Val Asn Ser Val Asn Ser ser He Pro Lys Ala Cys 
65 70 75 80 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
Cys Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp Glu 
85 90 95 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
5£ Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 

* 100 105 HO 
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(B) LOCATION: 9.. 1934 



(3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 60 

GGTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 120 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 180 

ACAGTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 240 

CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 300 

GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 360 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 414 

Met lie Pro Gly 
-292 -290 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
-285 -280 -275 

GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 
-270 -265 -260 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
- Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
-255 -250 -245 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
-240 -235 -230 -225 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 
-220 ~ -215 -210 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 
Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin lie His 
-205 -200 -195 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 
Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
-190 -185 -180 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn lie Pro Gly Thr 
-175 -170 -165 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 84 6 

Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser ser lie Pro 
-160 -155 -150 -145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 89 4 

Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 

-140 -135 -130 
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-10 -5 1 

Gly Ala Ser Ser His Cys Gin Lys Thr Ser Leu Arg Val Asn Phe Glu 
5 ^ 10 15 

Asp lie Gly Trp Asp Ser Trp He lie Ala Pro Lys Glu Tyr Asp Ala 
20 25 30 

Tyr Glu cys Lys Gly Gly cys Phe Phe Pro Leu Ala Asp Asp Val Thr 
35 40 45 50 

Pro Thr Lys His Ala Xle Val Gin Thr Leu Val His Leu Glu Phe Pro 
55 60 65 

Thr Lys Val Gly Lys Ala Cys Cys Val Pro Thr Lys Leu Ser Pro lie 
70 75 80 

Ser lie Leu Tyr Lys Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His 
85 ^ 90 95 

Tyr Glu Gly Met Ser Val Ala Giu Cys Gly Cys Arg 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to fflRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: osteosarcoma Cell Line 

(H) CELL LINE: U-20S 

fvii) IMMEDIATE SOURCE: 

(A) LIBRARY: U20S cDNA in Lambda gtlO 

(B) CLONE: Lambda U20S-3 

(viii) POSITION IN GENOME: 

(C) UNITS I bp 

{ ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 403.. 1629 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1279.. 1626 

(ix) FEATURE: 

(A) NAME/ KEY: mENA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Pro Gly Ala Phe Arg Val Ala Leu Leu Pro Leu Phe Leu Leu 
-318 -315 -310 " 305 

val Cys Val Thr Gin Gin Lye Pro Leu Gin Asn Trp Glu Gin Ala Ser 
-300 -295 -2SO 

Pro Gly Glu Asn Ala His Ser Ser Leu Gly Leu SerJSly Ala Gly Glu 



-285 



-280 -275 



Glu Gly val Phe Asp Leu Gin Met Phe Leu Glu Asn Met Lys Val Asp 

-270 "265 -260 -255 



Phe 



Leu Arg Ser Leu Asn Leu Ser Gly lie Pro Ser Gin Asp Lys Thr 
- —245 — 



-250 



Arg Ala Glu Pro Pro Gin Tyr Met lie Asp Leu Tyr Asn Arg^Tyr Thr 
-235 «*230 -ZZ» 

Thr Asp Lys Ser Ser Thr Pro Ala Ser Asn He Val Arg Ser Phe Ser 
-220 -215 -Z1U 

Val Glu Asp Ala He Ser Thr Ala Ala Thr Glu Asp Phe Pro Phe Gin 
-205 "200 -195 

Lys His He Leu He Phe Asn He Ser He Pro Arg His Glu Gin lie 
-190 -185 -180 i/S> 

Thr Arg Ala Glu Leu Arg Leu Tyr Val Ser cys Gin Asn Asp val Asp 
-170 -165 ~ 16 ° 

Ser Thr His Gly Leu Glu Gly Ser Met Val Val Tyr Asp Val Leu Glu 
-155 -150 -XM 

Asp ser Glu Thr Trp Asp Gin Ala Thr Gly Thr Lys Thr Phe Leu Val 
-140 -135 -130 

Ser Gin Asp He Arg Asp Glu Gly Trp Glu Thr Leu Glu Val Ser Ser 
-125 -120 "US 

Ala val Lys Arg Trp Val Arg Ala Asp Ser Thr Thr Asn Lys Asn Lys 
-110 -105 -100 5,3 

Leu Glu Val Thr Val Gin Ser His Arg Glu Ser Cys Asp Thr Leu Asp 

-85 -*»" 



-90 



lie Ser Val Pro Pro Gly Ser Lys Asn Leu Pro Phe Phe Val Val Phe 
-75 -70 -° 5 

Ser Asn Asp Arg Ser Asn Gly Thr Lys Glu Thr Arg Leu Glu Leu Lys 
-60 -55 " 5 ° 

Glu Met He Gly His Glu Gin Glu Thr Met Leu Val Lys Thr Ala Lys 
-45 -40 ~ 35 

Asn Ala Tyr Gin Val Ala Gly Glu Ser Gin Glu Glu Glu Gly Leu Asp 
-30 -25 "2° 

Gly Tyr Thr Ala Val Gly Pro Leu Leu Ala Arg Arg Lys Arg Ser Thr 
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-IS -10 -5 

AGS AGC ACC GGA GCC AGC AGC CAC TGC CAG AAG ACT TCT CTC AGG GTC 1608 
Arg ser Thr Gly Ala Ser Ser His Cys Gin Lys Thr ser Leu Arg Val 
X 5 

AAC ITT GAG GAC ATC GGC TGG GAC AGC TGG ATC ATT GCA CCC AAG GAA 1656 
Asn Piie Glu Asp He Gly Txp Asp Ser Trp He He Ala Pro lys Glu 
20 25 30 

•CAT GAC GCC TAT GAG TGT AAA GGG GGT TGC TTC TTC CCA TTG GCT GAT 1704 

Tyr Asp Ala Tyr Glu Cys Lys Gly Gly Cys Phe Phe Pro Leu Ala Asp 

— 40 *5 



35 



GAC GTG ACA CCC ACC AAA CAT GCC ATC GTG CAG ACC CTC GTC CAT CTC 1752 
Asp Val Thr Pro Thr Lys His Ala He Val Gin Thr Leu Val His Leu 
50 55 60 

GAG TTC CCC ACA AAG GTG GGC AAA GCC TGC TGC GTT CCC ACC AAA CTG 1800 
Glu Phe Pro Thr Lys Val Gly Lys Ala Cys Cys Val Pro Thr Lys Leu 
65 70 75 

AGT CCC ATC TCC ATC CTC TAC AAG GAT GAC ATC GGG GTC CCA ACC CTC 1848 
Ser Pro He Ser He Leu Tyr Lys Asp Asp Met Gly Val Pro Thr Leu 
80 85 90 95 

AAG TAC CAC TAT GAG GGG ATC AGT GTG GCT GAG TGT GGG TCT AGG TAGTCCCTGC 1903 
Lys Tyr His Tyr Glu Gly Met Ser Val Ala Glu Cys Gly Cys Arg 

AGCCACCCAG GGTGGGGATA CAGGACATGG AAGAGGTTCT GGTACGGTCC TCCATCCTCC 1963 

TGCGCATCGT ATCCCTAAGT TGATCAGAAA CCATCCTTGA GAAGAAAAGG AGTTAGTTGC 2023 

CCTTCTTCTC TCTGGTGGGT CCCTCTCCTC AAGTGACAAT GACTGGGGTA TCCGGGCCTC 2083 

TGGGCAGAGC AGGAGACCCT GGAAGGGTTA GTGGGTAGAA AGATGTCAAA AAGGAAGCTC 2143 

TGGGTAGATG ACCTGCACTC CAGTGATTAG AAGTCCAGCC TTACCTGTCA GAGAGCTCCT 2203 

GGCATCTAAG AGAACTCTGC TTCCTCATCA TCCCCACCGA CTTGTTCTTC CTTGGGAGTG 2263 

TCTCCTCAGG GAGAACAGCA TTGCTCTTCC TCTGCCTCAA GCTCCCAGCT GACTCTCCTG 2323 

TGGCTCATAG GACTGAATGG GGTGAGGAAG AGCCTGATGC CCTCTGGCAA TCAGAGCCCG 2383 

AAGGACTTCA AAACATCTGG ACAACTCTCA TTGACTCATC CTCCAACATA ATTTTTAAAA 2443 

2447 

AGAG 



(2) IMFORMATIOH FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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GAC AAA ACC AGA GCG GAG CCA CCC GAG TAG ATG ATC GAC TTG TAC AAC 888 
Asp Lys Thr Arg Ala Glu Pro Pro Gin Tyr Met lie Asp Leu Tyr Asn 

-240 -235 -230 

AGA TAC ACA ACG GAC AAA TCG TCT ACG CCT GCC TCC AAC ATC GTG CGG 936 
Arg Tyr Thr Thr Asp Lys Ser Ser Thr Pro Ala Ser Asn lie Val Arg 
-225 -220 -215 -210 

AGC TTC AGC GTG GAA GAT GCT ATA TCG ACA GCT GCC ACG GAG GAC TTC 984 
Ser Phe Ser val Glu Asp Ala lie ser Thr Ala Ala Thr Glu Asp Phe 

-205 -200 -195 

CCC TTT CAG AAG CAC ATC CTG ATC TTC AAC ATC TCC ATC CCG AGG CAC 1032 
Pro Phe Gin Lys His He Leu lie Phe Asn Xle Ser lie Pro Arg His 
-190 -185 -180 

GAG CAG ATC ACC AGG GCT GAG CTC CGA CTC TAT GTC TCC TGC CAA AAT 1080 
Glu Gin He Thr Arg Ala Glu Leu Arg Leu Tyr Val Ser Cys Gin Asn 
-175 -170 -165 

GAT GTG GAC TCC ACT CAT GGG CTG GAA GGA AGC ATG GTC GTT TAT GAT 1128 
Asp Val Asp Ser Thr His Gly Leu Glu Gly Ser Met Val Val Tyr Asp 
-160 -155 -150 

GTT CTG GAG GAC AGT GAG ACT TGG GAC CAG GCC ACG GGG ACC AAG ACC 1176 
Val Leu Glu Asp Ser Glu Thr Trp Asp Gin Ala Thr Gly Thr Lys Thr 
-145 -140 -135 -13 0 

TTC TTG GTA TCC CAG GAC ATT CGG GAC GAA GGA TGG GAG ACT TTA GAA 1224 
Phe Leu Val Ser Gin Asp He Arg Asp Glu Gly Trp Glu Thr Leu Glu 
-125 -120 -115 

GTA TCG AGT GCC GTG AAG CGG TGG GTC AGG GGA GAC TCC ACA ACA AAC 1272 
Val Ser Ser Ala Val Lys Arg Trp Val Arg Ala Asp Ser Thr Thr Asn 
-110 -105 -100 

AAA AAT AAG CTC GAG GTG ACA GTG CAG AGC CAC AGG GAG AGC TGT GAC 1320 
Lys Asn Lys Leu Glu Val Thr Val Gin Ser His Arg Glu Ser Cys Asp 
-95 -90 -85 

ACA CTG GAC ATC AGT GTC CCT CCA GGT TCC AAA AAC CTG CCC TTC TTT 13 68 

Thr Leu Asp lie Ser Val Pro Pro Gly Ser Lys Asn Leu Pro Phe Phe 
-80 -75 -70 

GTT GTC TTC TCC AAT GAC CGC AGC AAT GGG ACC AAG GAG ACC AGA CTG 1416 
Val Val Phe Ser Asn Asp Arg Ser Asn Gly Thr Lys Glu Thr Arg Leu 
-65 "60 -55 -50 

GAG CTG AAG GAG ATG ATC GGC CAT GAG CAG GAG ACC ATG CTT GTG AAG 1464 
Glu Leu Lys Glu Met He Gly His Glu Gin Glu Thr Met Leu Val Lys 
-45 -40 -35 

ACA GCC AAA AAT GCT TAC CAG GTG GCA GGT GAG AGC CAA GAG GAG GAG 1512 
Thr Ala Lys Asn Ala Tyr Gin Val Ala Gly Glu Ser Gin Glu Glu Glu 
-30 -25 -20 

GGT CTA GAT GGA TAC ACA GCT GTG GGA CCA CTT TTA GCT AGA AGG AAG 1560 
Gly Leu Asp Gly Tyr Thr Ala Val Gly Pro Leu Leu Ala Arg Arg Lys 
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What is claimed is: 



1. A BMP-9 polypeptide comprising the amino acid sequence from 
amino acid #8 - 110 as set forth in FIG. 3 (SEQ ID NO: 9) . 

2. A BMP— 9 polypeptide comprising the amino acid sequence from 
amino acid #1 - 110 as set forth in FIG. 3 (SEQ ID NO: 9) . 

3. A BMP-9 polypeptide of claim 1 wherein said polypeptide is 
a diner wherein each subunit comprises at least the amino acid 
sequence from amino acid #8 - 110 of FIG. 3 (SEQ ID NO: 9) . 

4. A BMP-9 polypeptide of claim 2 wherein said polypeptide is 
a dimer wherein each subunit comprises at least the amino acid 
sequence from amino acid #1-110 of FIG. 3. (SEQ ID NO: 9). 

5. A purified BMP-9 protein produced by the steps of 

(a) culturing a cell transformed with a cDNA comprising 
the nucleotide sequence from nucleotide #124 to #453 as shown 
in FIG. 3 (SEQ ID NO: 8) ; and 

(b) recovering and purifying from said culture medium a 
protein comprising the amino acid sequence from amino acid #1 
to amino acid #110 as shown in FIG. 3 (SEQ ID NO: 9) . 

6. A purified BMP-9 protein produced by the steps of 

(a) culturing a cell transformed with a cDNA comprising 
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the nucleotide sequence from nucleotide #124 to #453 as shown 
in FIG. 3 (SEQ ID NO: 8); and 

(b) recovering form said culture medium a protein 
comprising an amino acid sequence from amino acid #8 to amino 
acid #110 as shown in Figure 3 (SEQ ID NO: 9) . 

7. A BMP-9 protein characterized by the ability to induce the 
formation of cartilage and/ or bone* 

8. A DNA sequence encoding a BMP-9 protein. 

9. The DNA sequence of claim 8 wherein said DNA comprises 

(a) nucleotide 124 to 453 (SEQ ID NO: 8) ; and 

(b) sequences which hybridize thereto under stringent 
hybridization conditions and exhibit the ability to form 
cartilage and/or bone. 

10. The DNA sequence of claim 8 wherein said DNA comprises 

(a) nucleotide 145 to 453 (SEQ ID NO: 8); and 

(b) sequences which hybridize thereto under stringent 
hybridization conditions and exhibit the ability to form 
cartilage and/ or bone. 

11. A host cell transformed with a DNA sequence encoding BMP- 
8. 
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12. A method for producing a purified BMP-9 protein said 
method comprising the steps of 

(a) culturing a cell transformed with a cDNA comprising 
the nucleotide sequence encoding a BMP-9 protein; and 

(b) recovering and purifying said BMP-9 protein from the 
culture medium. 

13. A pharmaceutical composition comprising an effective 
amount of a BMP-9 protein in admixture with a pharmaceutically 
acceptable vehicle. 

14. A composition of claim 13 further comprising a matrix for 
supporting said composition and providing a surface for bone 
and/or cartilage growth. 

15. The composition of claim 14 wherein said matrix comprises 
a material selected from the group consisting of 
hydroxyapatite r collagen, polylactic acid and tricalcium 
phosphate - 

16. A method for inducing bone and/ or cartilage formation in a 
patient in need of same comprising administering to said 
patient an effective amount of the composition of claim 13. 

17. A pharmaceutical composition for wound healing and tissue 
repair said composition comprising an effective amount of the 



SUBSTITUTE SHEET 



WO 93/00432 



PCT/US92/05374 



47 

protein of a BMP- 9 protein in a pharmaceutical^ acceptable 
vehicle. 



18. A method for treating wounds and/or tissue repair in a 
patient in need of same comprising administering to said 
patient an effective amount of the composition of claim 17. 
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Figure 1A 

10 20 30 40 50 60 70 

CATTAATAAA TATTAAGTAT TGGAATTAGT GAAATTGGAG TTCCTTGTGG AAGGAAGTGG GCAAGTGAGC 

80 90 100 110 120 130 140 

TTTTTAGTTT GTGTCGGAAG CCTGTAATTA CGGCTCCAGC TCATAGTGGA ATGGCTATAC TTAGATTTAT 

150 160 170 180 190 200 210 

GGATAGTTGG GTAGTAGGTG TAAATGTATG TGGTAAAAGG CCTAGGAGAT TTGTTGATCC AATAAATATG 

220 230 240 250 260 270 280 

ATTAGGGAAA CAATTATTAG GGTTCATGTT CGTCCTTTTG GTGTGTGGAT TAGCATTATT TGTTTGATAA 

290 300 310 320 330 340 350 

TAAGTTTAAC TAGTCAGTGT TGGAAAGAAT GGAGACGGTT GTTGATTAGG CGTTTTGAGG ATGGGAATAG 

360 370 380 390 400 410 420 

GATTGAAGGA AATATAATGA TGGCTACAAC GATTGGGAAT CCTATTATTG TTGGGGTAAT GAATGAGGCA 

430 440 450 460 470 480 490 

AATAGATTTT CGTTCATTTT AATTCTCAAG GGGTTTTTAC TTTTATGTTT GTTAGTGATA TTGGTGAGTA 

500 510 520 530 540 550 560 

GGCCAAGGGT TAATAGTGTA ATTGAATTAT AGTGAAATCA TATTACTAGA CCTGATGTTA GAAGGAGGGC 
570 580 590 600 609 618 

> ^ | 

TGAAAAGGCT CCTTCCCTCC CAGGACAAAA CCGGAGCAGG GCCACCCGG ATG TCC CCT GGG 

M S P G 

627 636 645 654 663 672 

GCC TTC CGG GTG GCC CTG CTC CCG CTG TTC CTG CTG GTC TGT GTC ACA CAG CAG 
AFRVAIiLPLFLLVCVTQQ 

681 690 699 708 717 726 

AAG CCG CTG CAG AAC TGG GAA CAA GCA TCC CCT GGG GAA AAT GCC CAC AGC TCC 
KPLQNWEQASPGENAHSS 

735 744 753 762 771 780 

CTG GGA TTG TCT GGA GCT GGA GAG GAG GGT GTC TTT GAC CTG CAG ATG TTC CTG 
LGLSGAGEEGVFDLQMFL 

789 798 807 816 825 834 

GAG AAC ATG AAG GTG GAT TTC CTA CGC AGC CTT AAC CTC AGC GGC ATT CCC TCC 
ENMKVDPLRSLNLSGIPS 
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843 852 861 870 879 888 

CAG GAC AAA ACC AGA GCG GAG CCA CCC CAG TAC ATG ATC GAC TTG TAC AAC AGA 
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Figure 1C 

1383 1392 1401 1410 1419 1428 

TCC^^^^^GsgSgg^g^^^^ — — -— — 

oNDRSNGTKETRLEIiKEM 

1437 1446 "55 1464 1473 1482 

ATC GGC CM ATG CTT GTG AAG ACA GCC AAA AAT GCT TAC CAG 

IGHEQETMLVKTAKNAYO 

X491 1500 1509 1518 1527 1536 

VAGESQEEEGL .DG-YTAVG 
1545 1554 i 56 3 1572 1S81 1S9Q 

^KRSTGASSHCQK 

1 ™_ 1617(319> 1626 1635 (326) " 44 

XSLR.VNP ED.IGWD SWI I A 

1 ^ 1662 1671 1680 1689 1698 

CCC ^AG GAA ^AT CAC GCC TAT GAG TGT AAA GGG GGT TGC TTC TTC CCA TTG GCT 
fKEyDAYECKGGCPFPLA 
1707 1716 1725 1734 1743 1752 

GAT GAC GTG ACA CCC ACC AAA CAT GCC ATC GTG CAG ACC CTG GTG CAT CTC GAG 
O DVT P TK HA IVQT L V H L E 

1761 1770 1779 1788 17g7 18()6 

TTC CCC ACA AAG GTG GGC AAA GCC TGC TGC GTT CCC ACC AAA CTG ACT CCC ATC 
* F * KVG KACCVPTKLSPI 
1815 1824 1833 1842 1851 18g0 

TCC ATC CTC TAC GAC ATG GGG GTG CCA ACC CTC AAG TAC CAC TAT GAG 

*■ Xti KD DMGVPTLKYHYE 

1869 1878 1887 1903 1913 1923 

GGG ATG AGT GTG GCT GAG TGT GGG TGT AGG TAGTCCCTGC AGCCACCCAG GGTGGGGATA 
GMS. AECGCR 

(428) 
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Figure ID 

1933 1943 19S3 1963 1973 1983 1993 

CAGGACATGG AAGAGGTTCT GGTACGGTCC TGCATCCTCC TGCGCATGGT ATGCCTAAGT TGATCAGAAA 

2003 2013 2023 2033 2043 2053 2063 

CCATCCTTGA GAAGAAAAGG AGTTAGTTGC CCTTCTTGTG TCTGGTGGGT CCCTCTGCTG AAGTGACAAT 

2073 2083 2093 2103 2113 2123 2133 

GACTGGGGTA TGCGGGCCTG TGGGCAGAGC AGGAGACCCT GGAAGGGTTA GTGGGTAGAA AGATGTCAAA 

2143 2153 2163 2173 2183 2193 2203 

AAGGAAGCTG TG G GTAGATG ACCTGCACTC CAGTGATTAG AAGTCCAGCC TTACCTGTGA GAGAGCTCCT 

2213 2223 2233 2243 2253 2263 2273 

GGCATCTAAG AGAACTCTGC TTCCTCATCA TCCCCACCGA CTTGTTCTTC CTTGGGAGTG TGTCCTCAGG 

2283 2293 2303 2313 2323 2333 2343 

GAGAACAGCA TTGCTGTTCC TGTGCCTCAA GCTCCCAGCT GACTCTCCTG TGG CT CAT AG GACTGAATGG 

2353 2363 2373 2383 2393 2403 2413 

GGTGAGGAAG AGCCTGATGC CCTCTGGCAA TCAGAGCCCG AAGGACTTCA AAACATCTGG ACAACTCTCA 
2423 2433 2443 

TTGACTGATG CTCCAACATA ATTTTTAAAA AGAG 
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Figure 2 

10 20 30 40 50 60 70 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA GGTGAGTGTG 

80 90 100 110 120 130 140 

GCATCCGAGC TGAGGGACGC GAG CCT GAGA CGCCGCTGCT GCTCCGGCTG AGTATCTAGC TTGTCTCCCC 

150 160 170 180 190 200 210 

GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC ACAGTCCCCG GCCCTCGCCC AGGTTCACTG 

220 230 240 250 260 270 280 

CAACCGTTCA GAGGTCCCCA GGAGCTGCTG CTGG CGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC 

290 300 310 320 330 340 350 

GTAGTGCCAT CCCGAGCAAC GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GC AAGTTTGT TCAAGATTGG 

360 370 380 390 400 (1) 

CTGTCAAGAA TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT 

MET He Pro 

417 432 447 462 

GGT AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC GCG 
Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

477 492 507 

AGC CAT GCT ACT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC GAG ATT GAG 
Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala Glu lie Gin 

522 537 552 567 

GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG CTC CTG CGG GAC TTC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Phe 

582 597 612 627 

GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC CGC CGC CCG CAG CCT AGC AAG 
Glu Ala Thr Leu Leu Gin MET Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys 

642 657 672 

AGT GCC GTC ATT CCG GAC TAC ATG CGG GAT CTT TAC CGG CTT CAG TCT GGG GAG 
Ser Ala Val^Ile Pro Asp Tyr MET Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu 

687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC 
Glu Glu Glu Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala 

747 762 777 

AGC CGG GCC AAC ACC GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC 
Ser Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn lie 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser lie 
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Figure 2A 

352 867 882 897 

CCT GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG GTG 
Pro Glu Asn Glu Val lie Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT TAT GAG GTT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg lie Asn lie Tyr Glu val 

957 972 987 1002 

ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC ACA CGA CTA CTG GAC 
MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu lie Thr Arg Leu I^u Asp 

1017 1032 1047 

ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG GAA ACT TTT GAT GTG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val ser Fro 

1062 1077 1092 1107 

GCG GTC CTT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CTA GCC ATT GAG 

Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala lie Glu 

1122 1137 1152 1167 

GTG ACT CAC CTC CAT CAG ACT CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC 
Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg lie Ser 

1182 1197 1212 

CGA TCG TTA CCT CAA GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC 
Arg Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

ACC TTT GGC CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG 
Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala X,ys 

1287 1302 1317 

CGT AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC CGG 
Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 

1332(311) 1347 1362 1377 

CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC TGG ATT GTG 

Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 

1392 1407 1422 1437 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TTT CCA CTG 
Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Phe Pro I>eu 

1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT GTG CAG ACC CTG GTC AAT TCT 
Ala Asp His Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 

GTC AAT TCC AGT ATC CCC AAA GCC TGT TGT GTG CCC ACT GAA CTG AGT GCC ATC 
Val Asn Ser Ser He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
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Figure 2B 

1557 1572 15S7 

lib S S5 gig Gly gj gJ—S! — CSS! GGATAGACAG 
ATA*ACA^«CACACa£^ 

ACAGACTGCT TCCTTAtIgc TGGACT^A TTTAAAAA^f AAA&ARA^AATGGA^^TCCCIAaII 6 

ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATCSKT TTGACCAMwf TGATCAraS^TTTTGACABA. 6 

ATATATTTAT AACTACG^AT TAAAAGAAAA AAATAAAATG AGTCATTATT TTAAAAA^AAAAAAA^CT 
1946 

CTAGAGTCGA CGGAATTC 
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Figure 3 



TGA ACA ASA GAG TGC TCA AGA AGC TGT CCA AGG ACG GCT CCA GAG AGG 48 

* Thr Arg Glu Cys Ser Arg Ser Cys Pro Arg Thr Ala Pro Gin Arg 
-41 -40 -35 -30 

GAG GTG AGA GCA GTC ACG AGG AGG ACA CGG ATG GCG CAC GTG GCT GCG 96 
Gin Val Arg Ala Val Thr Arg Arg Thr Arg Met Ala His Val Ala Ala 
-25 -20 -15 -10 

GGG TCG ACT TTA GCC AGG CGG AAA AGG AGC GCC GGG GCT GGC AGC CAC 144 
Gly Ser Thr Leu Ala Arg Arg Lys Arg Ser Ala Gly Ala Gly Ser His 
-5 15 

TGT CAA AAG ACC TCC CTG CGG GTA AAC TTC GAG GAC ATC GGC TGG GAC 192 
Cys Gin Lys Thr Ser Leu Arg Val Asn Phe Glu Asp He Gly Trp Asp 
10 15 20 

AGC TGG ATC ATT GCA CCC AAG GAG TAT GAA GCC TAC GAG TGT AAG GGC 240 
Ser Trp He He Ala Pro Lys Glu Tyr Glu Ala Tyr Glu Cys Lys Gly 
25 30 35 

GGC TGC TTC TTC C.CC TTG GCT GAC GAT GTG ACG CCG ACG AAA CAC GCT 268 
Gly Cys Phe Phe Pro Leu Ala Asp Asp Val Thr Pro Thr Lys His Ala 
40 45 50 55 

ATC GTG CAG ACC CTG GTG CAT CTC AAG TTC CCC ACA AAG GTG GGC AAG 336 
He Val Gin Thr Leu Val His Leu Lys Phe Pro Thr Lys Val Gly Lys 

60 65 70 

GCC TGC TGT GTG CCC ACC AAA CTG AGC CCC ATC TCC GTC CTC TAC AAG 384 
Ala Cys Cys Val Pro Thr Lys Leu Ser Pro lie Ser Val Leu Tyr Lys 
75 80 85 

GAT GAC ATG GGG GTG CCC ACC CTC AAG TAC CAT TAC GAG GGC ATG AGC 432 
Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His Tyr Glu Gly Met Ser 

90 95 100 

GTG GCA GAG TGT GGG TGC AGG TAGTATCTGC CTGCGGG 470 
Val Ala Glu Cys Gly Cys Arg 
105 110 
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